-
Notifications
You must be signed in to change notification settings - Fork 3k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Documentation update, added info about the cloud storage UI and add AWS-S3 tutorial #3745
Conversation
Co-authored-by: Timur Osmanov <54434686+TOsmanov@users.noreply.github.com>
### Upload dataset | ||
|
||
Initially you need to create a manifest file for your image dataset. Information on how to do that is available | ||
on the [Simple command line to prepare dataset manifest file](/docs/manual/advanced/dataset_manifest) page. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@TOsmanov , let's repeat here basic steps to upload a simple dataset. Need exact instructions. Let's take a video file (take a link from serverless tutorial), prepare a manifest and upload them.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@nmanovic , as far as I understand cloud storage into UI does not support the use of video file, do you want me to describe the division of the video into frame sequence?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@Marishka17 , could you please comment on that?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@TOsmanov , OK. Let's use a dataset with images in the case.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Great work!
[script for prepare manifest file](https://github.com/openvinotoolkit/cvat/tree/develop/utils/dataset_manifest). | ||
Perform the installation using the manual [aws-shell](https://github.com/awslabs/aws-shell), | ||
I used `aws-cli 1.20.49` `Python 3.7.9` `Windows 10`. | ||
You can configure credentials by running. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
You can configure credentials by running. | |
You can configure credentials by running `aws configure`. |
![](/images/aws-s3_tutorial_7.jpg) | ||
|
||
### Prepare manifest file | ||
Now you need to prepare manifest file. I used [AWS cli](https://aws.amazon.com/cli/) and |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Now you need to prepare manifest file. I used [AWS cli](https://aws.amazon.com/cli/) and | |
Now you need to prepare a manifest file. I used [AWS cli](https://aws.amazon.com/cli/) and |
### Create user and configure permissions | ||
|
||
To access bucket you will need to create a user, to do this, go [IAM](https://console.aws.amazon.com/iamv2/home#/users) | ||
and click `add users`. You need to choose AWS access type have access key ID and secret access key. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
and click `add users`. You need to choose AWS access type have access key ID and secret access key. | |
and click `add users`. You need to choose AWS access type, have an access key ID and secret access key. |
You'll be taken to the bucket creation page. Here you have to specify the name of the bucket, region, | ||
optionally you can copy the settings of another bucket by clicking on the `choose bucket` button. | ||
Checkbox block all public access can be enabled as we will use `access key ID` and `secret access key` to gain access. | ||
In the following sections you can leave the default settings and click `create bucket`. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
In the following sections you can leave the default settings and click `create bucket`. | |
In the following sections, you can leave the default settings and click `create bucket`. |
@TOsmanov , Could you please also prepare the same tutorial for Azure in another PR? |
@Marishka17 Okay, I'll add |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Great job!
Motivation and context
Depended #3724
Issues:
It may be worth adding support for S3 inventory in CVAT. In the S3 inventory contains paths to files with extension, and you can also add an additional field
ETag
which can be an MD5 digest of the object data.name
parameter has\\
Instead of/
. For fix this, I used.replace
incore.py
file, perhaps there is a better solution.Preview:
https://tosmanov.github.io/cvat/docs/manual/basics/attach-cloud-storage/
How has this been tested?
Checklist
develop
branchcvat-core, cvat-data and cvat-ui)
License
Feel free to contact the maintainers if that's a concern.